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On a theorem of Erdös and Simonovits on 
graphs not containing the cube 


Abstract: The cube Q is the usual 8-vertex graph with 12 edges. Here we give a new 
proof for a theorem of Erdős and Simonovits concerning the Turan number of the cube. 
Namely, it is shown that e(G) < n^ + (2n)*’? holds for any n-vertex cube-free graph G. 

Our aim is to give a self-contained exposition. We also point out the best known 
results and supply bipartite versions. 
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1 History of Turan type problems 


As usual, we write |G|, e(G), deg,.(x) for the number of vertices, number of edges, and 
the degree of a vertex x of a graph G. Denote by N,(x) (or just N(x)) the neighborhood 
of x, note that x € N(x). Let K, and K,, denote the complete graph on n vertices and 
the complete bipartite graph with bipartition classes of sizes a and b. K(A, B) denotes 
the complete bipartite graph with partite sets A and B(AN B= 9). 

A graph G not containing H as a (not necessarily induced) subgraph is called 

H-free. Let us denote by ex(n, H) the Turán number for H, i.e. the maximum number of 
edges of an H-free graph on n vertices. More generally, let ex(G, H) be the maximum 
number of edges in an H-free subgraph of G. Then ex(n, H) = ex(K,, H). We also use 
the notation ex(a, b, H) for ex(K,,,, H) and call it the bipartite version of Turan number. 
Also, if F c H then ex(n, F) < ex(n, H). 
Turan [29] determined ex(n, K pti) The extremal graph is the almost equipartite 
complete graph of p classes. He also proposed the general question, ex(n, H), in partic- 
ular the determination of the Turan number of the graphs obtained from the platonic 
polyhedrons, the cube Q = Q; (it is an 8-vertex 3-regular graph), the octahedron O, (six 
vertices, 12 edges), the icosahedron I,, (12 vertices, 5-regular) and for the dodecahe- 
dron D,, (20 vertices, 30 edges). Erdös and Simonovits [12] gave an implicit formula for 
ex(n, Og) (they reduced it to ex(n, C,)) and Simonovits solved exactly ex(n, D,,) in [26] 
and ex(n, I,,) in [27] (for n > ny). 

In fact, Turan’s real aim was not only these particular graphs but to discover a gen- 
eral theory. His questions, and the answers above, indeed lead to an asymptotic (the 
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Erdös-Simonovits theorem [10]) and to the Simonovits stability theorem concerning 
the extremal graphs [25] in the case when the sample graph H has chromatic number 
at least three. For asurvey and explanation see Simonovits [28] or the monograph of 
Bollobäs [5]. 

However, the bipartite case is different, see the recent survey [16]. Even the ex- 
tremal problem of the cube graph Q, which was one of Turän’s [30] originally posed 
problems, is still unsolved. Our aim here is to give a gentle introduction to this topic. 
We survey the results and methods concerning ex(n, Q), give new or at least stream- 
lined proofs. We only use basic ideas of multilinear optimization (Lagrangian, con- 
vexity, etc.) and in most cases just high school algebra. We also consider the case of 
bipartite host graph, i.e., ex(a, b, Q). 


2 Walks 


Let W, = W,(G) denote the number of walks in G of length 3, i.e., the number of se- 
quences of the form x)x,x,x3 where x;_,x; isan edge of G (fori = 1, 2, 3). Note that, e.g., 
xyxy is a walk (if xy € E(G)) and it differs from yxyx. A d-regular graph has exactly 
nd’ 3-walks. 


Theorem 1. For every n-vertex graph G for the number of 3-walks one has 


2 
W, > n( = X deg(x)”?) ; (1) 
n xeV 
The r-order power mean of the nonnegative sequence a,,...,a,, is M,(a) := 
(4 5 at)". Then for 1 <r < s < œ one has 
Ave = M, (a) < M, (a) < M,(a) < Mẹ (a) := max la;l. (2) 


We will frequently use it in the equivalent form 
r/s 
> a < (Ya) mils), GB) 
l<i<m 


This is just a special case of the Hölder inequality, i.e., for any two nonnegative vectors 
x,y € R” and for reals p,q > 1 with 2 + r = 1 one has 


1/p 1/q 
Sans) "(ER)" 
We get (3) by substituting here x = (a})ı «m» Y = (bL...,1), 1/p = r/s and 1/q = 
1-(r/s). 
Proofof Theorem 1. Considering the middle edge of the 3-walks one obtains that 


W, = 5 >, deg(x) deg(y). 


xEV yeN(x) 
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Here we have 2e = nd 
Schwartz inequality 


terms. We use for this sum the 2e-dimensional Chauchy- 


(ZA) LE) (Zar) 


valid for any two vectors a,b € R”. Our aim is to separate the variables in the products 


deg(x) deg(y) so we take a = {ydeg(x) deg(y)} xev, yen(x) and b = {V1/ deg(x)} rev, yen): 


One obtains that 


wyn=(¥ y deg(x) dego) )( X y = Pangea 


ave 


xEV yEN(x) xEV yeN(x) 
= ( $ dego)? ). m 
yev 


Historical remarks. One can rewrite Theorem 1 as 
W, >nM, na. (4) 
Then the power mean inequality (2) with (r,s) = (1, 3/2) gives that 


W; > n(d,,.)” = 8e°/n’. (5) 


This inequality W, > n(d,,.)° is due to Mulholland and C. A.B. Smith [20] and was 
generalized by Attkinson, Watterson and Moran [3] for W, for every k > 3 in a form of 
a matrix inequality. Then it was further generalized by Blakley and Roy [4] for all non- 
negative symmetric matrices. As far as the author knows the obvious consequence 
of their works, W, > n(d,,.)*, was first explicitly stated in a paper of Erdős and Si- 
monovits [13]. For the interested reader we supply a direct proof for (5) using only high 
school algebra in the Appendix (Section 8). 

Theorem 1 is not really new. It is an easy consequence (of a special case) of a re- 
sult of Jagger, Stovi¢ek, and Thomason [18], who while working on a conjecture of 
Sidorenko [24] showed the inequality X , w(x)!” > X, deg(x)”? where w(x) is the num- 
ber of 3-walks whose second vertex is x. 


The exponent 3/2 is the best possible. Consider a complete bipartite graph K,,, 
we have W, = 2a’b’. Then W,/nM,(d)’ > 0 for any fixed p > 3/2 whenever 
b/a > ow. 

For K,, we have W; = 2a”b’, while the right-hand side of (1) is a” b? (ary whichis 
between ab’ and 2a?b?. Using this observation one can show the following: Suppose 
that d,,...,d,, is the degree sequence of a graph G. There is a graph H with degree 
sequence d! for which d! > d; and W,(H) < 4n(¥(d})*/n)* = 4nM,,(d')’. 
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3 3-paths in bipartite graphs 


Let P, = P,(G) denote the number of 3-paths of G. We have 2P; < W,. Using the method 
of the previous section we show the following lower bound for P}. 


Theorem 2. Let G(A, B) be a bipartite graph with e edges and with color classes A and 

B, |A| = a, |B| = b. Suppose that every vertex has degree at least 2. Then for the number 

of 3-paths one has 

e(e — a)(e- b) 
ab i 


Proof. Considering the middle edge of the 3-paths one obtains that 


P; > (6) 


P,=) }_ (deg(x) - 1)(deg(y) - 1). 


xEA yEN(x) 


Here we have e terms. One obtains that 


aP,=a) ). (deg(x) - 1)(deg(y) - 1) 


xEA yeN(x) 


=ay Y -(degyp)-1)+a). ), deg(x)(deg(y) - 1) 


xEA yeN(x) xEA yeN(x) 


1 
= -a Y dest y)deg(y) -D + (X I go È deasga- n). 


xEA yeN(x) xEA yeN(x) 


Here the second term is at least 


ydeg(x)(deg(y) — 1) \? 2 
DA Aat -AZA t AA _ deg(y) — 1 
ee) EN) 


- ( $ deg(y) ydeg(y) - i). 
yeB 


Let F(y,, Y2» - - -> Y) be a real function defined as 


=A z (y? = y) + (Zr), 


1si<b 


> 


where y; > 2 and Y y; > 2a. We obtained that aP, > F(y) where y € R” is the vector 
with coordinates formed by the degrees {deg(y) : y € B}. We will see that F is con- 
vex in this region, hence all y; can be replaced with the average of the degrees, i.e., 
È „eg deg(y)/b = e/b. One obtains 


2 
efe e le 
aP; > -abt (2 - 1) + (e2 - 1) ; 
$ b\b b \b 
Rearranging one gets (6). 
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Proof of convexity. Let F 
fori # j one has 


ip Fu denote the partial derivatives, H the Hessian of F. Then 


3y,-2 3y,-2 
eee za 2 > 
2yy-1,jy,-1 
and 
1 =>) 3y, -1 
Fy = -2a + Gly) SATIN Gq) —2 


+ | 
yi-1l 2(y, - 1) Vy¥j- 1 


1(3y;-2)? 13y,-2 3y,-2 
> -2a + (Gly) - per =: 


M1 2 wi-1vwyi- 


where G(y) = X y; Vy; — 1. Since (3y-2)”/2(y-1) = 6fory > landG(y) 2 X y; =e > 2a 
we can write H as a sum of a positive semidefinite matrices, namely 1/2 times the 
tensor product of the vector { 3} with itself, and a diagonal matrix with diagonal 
entries exceeding -2a + 6(G(y) - 1), again a positive definite matrix. Thus H is positive 
definite and then F is convex in the region. m 


The above theorem is a slightly improved version of a result of Sidorenko [23] 
which states P} > e’/ab- Ae, where Ais the maximum degree of & a general 
(non-bipartite) graphs, Theorem 1 implies that P} > in(dwe) - 5 - dye. This inequal- 
ity may also be deduced from a Moore-type bound, established a Alon, Hoory and 
Linial [2]. 


4 Graphs without C, 


Theorem 3. Let G(A, B) be a bipartite graph with e edges and with color classes A and 
B, |A| = a, |B| = b. Suppose that G has girth eight. Then for the number edges one has 


e < (ab)? +a+b. (7) 


Proof. We use induction on the number of vertices if there is any isolated vertex, ora 
vertex of degree 1. Otherwise, observe, that every pair x € A, y € Bis connected by at 
most one path of length 3. Thus P, < ab. Comparing this to the lower bound for P, in 
(6) and rearranging we get the Theorem. m 


D. de Caen and Székely [7] showed that e(G) = O((ab)?’?) assuming a = O(b?) and 
b = O(a’). Later they showed [8] that if G has girth eight and every vertex has degree 
at least two, then e < 2!/3(ab)?/> and here the coefficient 2!/° is the best possible by 
exhibiting a graph with a = 2s, b = s* ande = 2s”. (Note that this does not contradict 
our result (7) since here b = e/2). 

Győri [17] observed that in a C,-free graph G the maximal complete bipartite 
graphs K,,,’s with a, p > 2 are edge disjoint (indeed, these are K, g's). Thus one can 
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remove edges from G such that the resulting graph G, is C,-free and e(G,) = 5e(G). 
Thus Györi’s result combined with Theorem 3 gives that 


Corollary 4. If G is a C,-free bipartite graph with parts of sizes a and b then e(G) < 
2(ab)?? + 2a + 2b. m 


More is true. In [14] it was proved that for such a graph 
e(G) < 2" (ab)? + 16(a + b) (8) 


holds. Moreover infinitely many examples show that the coefficient 2!/° in the best 
possible for large a and b with b = 2a. 

Concerning general (not necessarily bipartite) graphs, it was proved by Bondy and 
Simonovits [6] in 1974 that a graph on n vertices with at least 100kn'*!/* edges con- 
tains C,,, a cycle of length 2k. This was extended into bipartite graphs with parts of 
sizes of a and b by G. N. Särközy [22] who showed that such a graph with max{90k(a + 
b), 20k(ab)'*"/*} edges contains a C,,. Our Corollary 4 gives these for C,, even a slightly 
better statement, using the following important reduction theorem. 


Lemma 5 (Erdős [9]). Let G be anarbitrary graph. Then there exists a bipartite subgraph 
Gy with degg, (x) = 5 deg.(x) for all vertices. Especially,e(G,) > 5e(G). 


Corollary 6. IfG is aC,-free graph on n vertices then e(G) < 2?!’n*!? + 4n. o 


It is known that there are C,-free graphs with at least (+ + 0(1))n*”* edges [19], and 
the best known lower and upper bounds can be found in [14], (namely 0.5331? < 
ex(n, Cg) < 0.628n*’ for n > no). Yuansheng and Rowlinson [31] determined ex(n, C) 
and all extremal graphs for n < 26. 


5 Cube-free graphs 


Theorem 7 (Erdös and Simonovits [11]). Let Q denote the 8-vertex graph formed by the 
12 edges of a cube. Then ex(n, Q) < O(n®’*). 


The original proof of this is rather complicated. It applies a remarkable regulariza- 
tion process for non-dense bipartite graphs. A somewhat simpler proof was found by 
Pinchasi and Sharir [21], who were interested in certain geometric incidence problems, 
and who also extended it to a bipartite version 


e(G(A, B)) < O((ab)*> + abl? + ald). (9) 


Here we give an even simpler proof which also gives the bipartite version, see (10) 
below. We only use Theorem 2, Corollary 4 and the power mean inequality (3), but the 
main ideas are the same as in [11]. 
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Proof of Theorem 7. Let G be an n-vertex Q-free graph. First, applying Erdős’ Lemma 5 
we choose a large bipartite subgraph G(A, B) of G, e(G) < 2e(G(A, B)). 

We say that a hexagon z,z,...z, lies between the vertices x and y if z,,z;,z, are 
neighbors of x and the other vertices of the hexagon are neighbors of y, i.e., z,, Z3, Z5 € 
N(x) and z,,2,,2; € N(y) and {x, y} N {z,,...,% } = Ø. The crucial observation is that 
x and y together with the 6 vertices of a hexagon between them contain a cube Q. So 
there is no such hexagon in a Q-free graph. Thus we can apply the upper bound for 
the Turan numbers of C,, i.e., Theorem 4 and obtain an upper bound for the number 
of edges uv, u € N(x), v € N(y). This gives an upper bound for the number of paths 
with end vertices x and y. For given x and y we have 


#xuvy paths = |{uv € E(G(A, B)) : u € N(x) \ {yh v € N(y) \ {x} 
< 2|N(x)P? INPP + 2N) + 2N). 


Add this up for every x € A, y € B. Let e := e(G[A, B]) and use (3) with (r, s) = (1, 3/2). 
We have 


P,(G(A,B)) < ), ), 2deg(x)”? deg(y)”? +2 deg(x) + 2 deg(y) 


xeA yeB 
= 2( > deg(x)"!”)( $ deg(y)?”’) + 2be + 2ae 
xEA yeB 


< 2x eg? y ebb! + 2(a + b)e. 
Comparing this to the lower bound in Theorem 2 one obtains that 


(e - a)(e - b) < 2e!’ (ab)*? + 2(a + b)ab. 


This implies that 
e < 2’ (ab) + 2ab'” + 2a'b, (10) 
Using e(G) < 2e we obtain 
e(G) < n?” + (2n)?”? (11) 
finishing the proof. m 


If we use (8) instead of Corollary 4 then the above calculation gives 


Theorem 8. 
ex(a, b, Q) < 2 °(ab)*? + Hab!” + a!!?v) (12) 


and 
ex(n, Q) < 2’ + 13n°?. (13) 
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6 A lower bound on the number of C,,’s 


Let N(G, H) denote the number of subgraphs of G isomorphic to H. This function is 
even more important than the original Turan problem. Here we consider only one of 
the simplest cases, H = Cy. 

It was observed and used many times that for sufficiently large e the graph G con- 
tains at least Q(e*/n*) copies of C,. This result goes back to Erdős (1962) and was pub- 
lished, e.g., in Erdős and Simonovits [11] in an asymptotic form (N(G, C,) > (1/ 3)e*/n? 
forn > Cn’). The following simple form has the advantage that it is valid for arbitrary 
nande. 


Lemma 9 (see [15]). Let G be a graph withe edges and n vertices. Then 


3 2 4 
e(e-n) e e 3 
N(G, C,) 2 2 —— - — 2 2— - =en. 14 
(G Ca) nt 2n m g” 04) 
Allen, Keevash, Sudakov, and Verstraéte [1] gave a bipartite version of Lemma 9. 
Here we state their result in a slightly stronger form (it is valid for all values of a, b and 


e). Note that the formula is not symmetric in A and B. 


Lemma 10. Let G be a bipartite graph with parts A and B of sizes a and b and e edges. 
Then the number of 4-cycles in G is at least 


e’(e — b} — e(e — b)ba(a - 1) 


4b?.a(a — 1) (15) 


For completeness we present the proofs of the above Lemmas (below and in the 
Appendix). But we will need a slightly stronger and more technical version. 


Lemma 11. Let G be a bipartite graph with parts A and B of sizes a and b and e edges. 
Let D(x) denote È yen (deg( y) - 1). Then the number of 4-cycles in G is at least 


m. X Dw’) a H X PW), (16) 


Proof. We have 


ean r aa ("| 


{x,x'}CA 
= 1 Er De \x d(x, x')/(a- 1) 
2 2,0 D( : 2 ) 
_a-l D(x)/(a- 1) 
a) 
_ 1 2\ 1 
NET (2,0% 3 (22%) m 
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Note that Lemma 11 easily implies Lemma 10. Indeed, observe that for e(e — b) < 
ba(a - 1) the right-hand side of (15) is negative, so we may suppose that (e?/b) -e > 
a(a - 1). Use Chauchy-Schwartz for J,,., D(x). We obtain 


xeA 


2 
yD) => ( yd Gdeg(y) - 1)) = J deg(y)’ - È deg(y) > = e. 
yeB 


xeA xeA  yeB,xyEE(G) yeB 


Use Chauchy-Schwartz again for ¥ D(x)?. We have 


1 2 
D(x)’ = -( Y D(x)). 
YP? = 2( X pw) 
Now Lemma 11 gives that N(G,C,) = (N? /4a(a - 1)) - (N/A) for N := Xea D(x). Since 
N > (e?/b) -e > a(a - 1) the polynomial p(N) = N? /a(a- 1) - N is increasing and we 
get N(G,C,) = p(N) = p(e(e - b)/b)). m 


7 Cubes with a diagonal 


Theorem 12 (Erdös and Simonovits [11]). Let Q* denote the 8-vertex graph formed by 
the 12 edges of a cube with a long diagonal. Then ex(n,Q*) < O(n®’*). 


Here we give a simpler proof which also gives a stronger bipartite version. 
Theorem 13. ex(a, b, Q*) < 2°/°(ab)*> + O(ab"? + a™’?b). 


Using again Erdős’ Lemma 5 and a + b = n we get 
ex(n,Q*) < nb + OP). (17) 


Proof of Theorem 13. Let G be an n-vertex Q*-free bipartite graph with classes A and 
B. The main idea is the same as in [11] and in the proof of Theorem 7. The crucial ob- 
servation is that an edge xy € E(G) together with the 6 vertices of a hexagon between 
them form a Q*. So there is no such hexagon in a Q*-free graph between the neigh- 
borhoods of two connected vertices. Thus we can apply the upper bound for the Turan 
numbers of C,, i.e., Theorem 4 and obtain an upper bound for the number of edges 
x'y', y! € N(x), x’ € N(y) for xy € E(G). This gives an upper bound for the number of 
four cycles containing the edge xy. 


#xy'x'y four cycles = |{x' y" € E(G(A, B)): y' € N(x) \ {y} x" € N(y) \ {x} 
< 2(IN(x)| - DIN) - D7? + 21N(x)| + 2IN(y)I = 2. 
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Add this up for every x € A, y € B, xy € E(G) and apply (3) for Y,,<p(deg(y) - 1)7” 


with (r, s) = (1, 3/2) for every x. We obtain 


4N(G,Cy) < YY 2(deg(x) - 1)?” (deg(y) - 1)°° +2 deg(x) + 2 deg(y) - 


xEA yeN(x) 


a =2( Y (deg(x) - 1° deg(x)'D(x)"”") + 2( 2 Dv) + 2. > Dw); 


xeA 


Apply Holder inequality with 1/p = 2/3 and 1/q = 1/3 in the first term. We obtain it is 
at most 


2/3 1/3 
< 2( X (deg(x) - 1) deg(x)'”) (DW) . (18) 


xeA xeA 


From now on, to save time and energy, and to better emphasize the main steps of 
calculation we only sketch the proof. Compare the obvious leading terms in the lower 
and upper bounds (16) and (18) for N(G, C,), we have 


zal 2 Dy ) « 4N(G,C,) « 2( X (dena) = D dego!) (Epe) 


yielding 
( X Dw?) < (21a - nà F Gego - 1) deg(x)""), (19) 


xeA 


On the left-hand side we can use Cauchy-Schwartz and on the right-hand side we ap- 
ply (3) with (r,s) = (3/2,2). We obtain 


-( D(x) < Qla- 1)"a'( Y (deg) - TO 
xEA xEA 
Rearranging we have 
( > D(x) < 2a ee X 20) (20) 


xeA yeB 


Exchange the role of A and B, we get 
2 3/4 
(¥ pw) <27 Y De») 
yeB xeA 
Multiply the above two inequalities, take 4" power, we get 


22(ab)!! > ( 3 DW) ( $ Diy) > 2(£ = ej (Ż — e) 


xEA yeB 


leading to 2'?(ab)'° > e”. o 
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8 Appendix 


(1) A direct proof of the Mulholland-Smith inequality (5) concerning the number of 
3-walks using only high school algebra. 
Considering the middle edge of the 3-walks one obtains that 


W, = $ > deg(x) deg(y). 


xEV yeN(x) 


Here we have 2e = ndwe terms. Our aim is to separate the variables in the prod- 
ucts deg(x) deg(y) so next we use first that the 2e-dimensional quadratic inequality 
(quadratic mean is greater than or equal the arithmetic mean), second we use (for 2 
variables) that the arithmetic mean is greater than or equal the harmonic mean, then 
third time we use again (this time for 2e variables) that arithmetic > harmonic. One 
obtains that 


nd 2e 


Ixev ven) V deg(x) deg(y) 


2e 


O D a 


xEV yeN(x) deg(x) E deg(y) 


ave 


IV 


> 2e 


1 1 
deg(x) + deg(y) i 2e 2e = 


= = ave 
xEV yeN(x) 2 È xev 1 n 


(2) Proof of Lemma 9 concerning the number of C,’s. 
Denote the number of x, y-paths of length two by d(x, y). We have 


I(T 5 nl)" ZEN) 0 


x,yeV(G) xeV(G) 


Therefore, d > ria, Moreover 


1 d(x, 1 d 
NG,C)=> ¥ ( Z a > ZCN ). (22) 
Deo 52 2\2/\2 
We may suppose that the middle term in (14) is positive, which implies that en > 


1/2. The paraboloid (3) is increasing for x > 1/2. So we may substitute the lower bound 


of d from (21) into (22) and a little algebra gives (14). o 
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